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Human African trypanosomiasis (HAT), 
also known as sleeping sickness, is a neglected 
disease that impacts 70 million people 
distributed over 1.55 million km 2 in sub- 
Saharan Africa and includes at least 50% of 
the population of the Democratic Republic 
of the Congo [1]. Trypanosoma brucei 
gambiense accounts for more than 98% of 
the infections in central and West Africa, the 
remaining infections being from Trypanoso- 
ma brucei rhodesiense in East Africa [2]. The 
parasites are transmitted to the hosts through 
the bite of an infected tsetse fly. Disease 
control is challenging as there are no 
vaccines, and effective, easily delivered drugs 
are still lacking. Treatment invariably in- 
volves lengthy hospitalization, with both 
medical and socioeconomic consequences. 
Control of disease can be accomplished, 
however, through vector control, which 
largely to date has aimed to reduce insect 
populations rather than eliminate them. In 
the mid- 1 990s, disease cases were increasing 
but tsetse research and facilities that could 
maintain tsetse fly colonies were on a decline 
globally, particularly in Africa. This was also 
at a time when new scientific advances were 
being realized in other vector-borne disease 
systems, particularly building on the revolu- 
tion in genome sequencing and genome- 
wide analyses. 

In 2004, the Molecular Entomology 
program of the World Health Organiza- 
tion Tropical Diseases Research (WHO- 
TDR) unit began an initiative (Interna- 
tional Glossina Genome Initiative, IGGI) 
that brought together an interdisciplinary 
group of researchers from multiple coun- 
tries and institutions to explore possibilities 
for genomics and post-genomics research 
in the Tsetse field. The first major goal of 
IGGI was to produce a reference genome 
sequence for single Glossina species — 
work that is described in [3]. However, 
alongside this direct output, there were 
two further goals for this initiative. First, to 
understand the genomic basis of unique 
aspects of tsetse biology with a focus on 



applied discoveries, those that could be 
exploited to aid in vector control; and 
second, to use this network to help build 
global capacity, in Africa and globally, for 
genetics and genomics-based research into 
tsetse. This vision was crystalized at the 
outset with a decision to generate sustain- 
able genomic resources on the African 
continent, and in turn to use these 
resources to provide training in genome 
analytical skills (Figure 1). The IGGI 
consortium met annually to organize 
training workshops, to initiate transcrip- 
tomics-based projects, to recruit financial 
resources to undertake the WGS initiative 
and to track progress once the project was 
successfully launched. Wellcome Trust 
funding for the WGS project provided 
the much-needed momentum to ensure 
the project remained on track. 

Data Curation as a Model for 
Genomics Training 

In lieu of a completely sequenced 
Glossina genome, the consortium recruit- 
ed global funds that enabled the develop- 
ment of a molecular toolbox, which 
initially included the production of ex- 
pressed sequence tags (ESTs) — sequences 
produced from the ends of cloned 
cDNA fragments — from 1 1 large cDNA 
libraries, along with the construction and 



sequencing of a bacterial artificial chro- 
mosome (BAC) library. The molecular 
toolbox became the training instrument 
for five ten-day bioinformatics workshops 
held at the University of the Western 
Cape, South Africa, involving a total of 
75 students and researchers from 18 
African countries (Table 1). Although, 
by their very nature, the EST libraries 
only offered a partial and highly frag- 
mented picture of the genome, they were 
nonetheless an opportunity to gain insight 
into the molecular mechanisms and 
processes that govern vectorial capacity 
of Glossina [4—9]. These workshops 
provided an environment for students to 
interact with international experts from 
Yale University, European Bioinformatics 
Institute, Wellcome Trust Sanger Insti- 
tute, Liverpool School of Tropical Med- 
icine, and RIKEN. The manual annota- 
tion exercise of the Glossina 
transcriptome resulted in a curated Glos- 
sina dataset (www.vectorbase.org) and 
afforded graduate students an opportunity 
to acquire a skill set that could be 
transferred to other sequencing projects. 
At least 20% of participants have subse- 
quently forged international collabora- 
tions with these trainers or have, them- 
selves, become sought after as genomics 
trainers and supervisors of graduate 
students. 
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International Student Exchange 
Programs 

The IGGI consortium held annual 
executive meetings, which either coincided 
with workshop activities or at scientific 
meetings where many members were 
present. During the annual IGGI consor- 
tium meeting in November 2009, the 
tsetse biological laboratories from within 
the consortium established themselves as 
remote mentoring sites for the Glossina 
Functional Genomics Network. This en- 
abled three African students to participate 
in a two-month exchange program hosted 
annually in laboratories in the United 
States and Europe. These graduate stu- 
dents gained additional skills in genomics 
and functional genomics areas, and the 
visits provided them with an established 
scientific network for academic support 
while they continued their graduate stud- 
ies. The first cohort of the exchange 
program was subsequently funded to 
present their work at the bi-annual African 
Society for Bioinformatics and Computa- 
tional Biology meeting in Cape Town, 
South Africa in 201 1. These students have 
afterwards either completed their PhD 
degrees or are in their final stages of doing 
so. Four additional centers on the African 
continent were identified as partners 



within the Glossina Functional Genomics 
Network — in Malawi, Zambia, Sudan, 
and Uganda. 

The skills development funding ob- 
tained from the WHO-TDR program 
was maximized to create a research- 
enabling environment in tsetse-endemic 
countries by supporting recipients of the 
exchange programs. To this end, in 
collaboration with Professor Erik Bong- 
cam-Rudloff, Uppsala University, the 
EBIOKIT was adopted for African tsetse 
research laboratories to carry out genomic 
analyses locally without the need to rely on 
internet connectivity. The bioinformatics 
computing environment was rolled out in 
Kenya, where 44 participants received 
hands-on training to use the myriad of 
bioinformatics analytical tools embedded 
within the EBIOKIT. This international 
collaborative training session impacted 
Egerton University's Faculty of Agricul- 
ture, Jomo Kenyatta University of Science 
and Technology, Moi University, Kenya 
Wildlife Services, and the Kenyan Minis- 
try of Higher Education. 

African Institution Partnerships 

The impact of the IGGI consortium 
activities over the past eight years can be 
measured by the capacity development 



activities afforded to both students and 
junior researchers in Africa. The IGGI 
activities have been a catalyst for graduate 
training in genomics on the African 
continent. Notwithstanding the training 
workshops, at least ten MSc and 12 PhD 
students were trained as a direct conse- 
quence of access to Glossina genome data 
(Table SI). Having an established network 
allowed some of the African participants to 
gather additional research funds from the 
WHO-TDR Entomology Committee to 
begin a population genetics investigation 
in the Lake Victoria Basin. This initiative 
further enabled the network to recruit 
three PhD students to be trained in 
genomics research. Similarly, the Yale 
scientists within IGGI were successful in 
securing funds from the Fogarty Interna- 
tional Center to help support capacity in 
tsetse research in East Africa. These funds 
allowed the network participants to orga- 
nize workshop activities in Kenya and 
Uganda focusing on population genetics/ 
genomics, bioinformatics, and functional 
genomics fields for tsetse researchers and 
students. 

Researchers, on the other hand, who 
have been part of the IGGI consortium, 
have facilitated short courses at their 
local institutions; been invited to act as 
reviewers for international journals; and 
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Figure 1. Glossina molecular toolbox to facilitate genomics training. The international Glossina Genome Initiative developed genomic 
resources including ESTs, a BAC library, and the sequenced Glossina morsitans mortisans genome to facilitate genomic capacity development on the 
African continent. 
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won competitive grants from IAEA, 
WHO-TDR, BecA/NEPAD, National 
Institutes of Health (NIH), Fogarty 
International Center of NIH, and the 
South African National Research Foun- 
dation. More recently, networks have 
been established to enable graduate 
students to be cosupervised across Afri- 
can institutions where experimental 
work is being carried out at Interna- 
tional Centre of Insect Physiology and 
Ecology (ICIPE) and the Trypanosomi- 
asis Research Center (TRC) in Kenya, 
Gulu University and Makerere Univer- 
sity in Uganda, as well as at Yale 
University in the US, and the bioinfor- 
matics analysis and training is conduct- 
ed at the South African National 
Bioinformatics Institute in South 
Africa. 
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ship and long term commitment. Access 
to the human genome sequence, the 
trypanosomatid genomes [10-13], and 
trypanosome 'omics data [14,15] has 
provided insight into host-parasite in- 
teractions and the identification of new 
vaccine candidates or chemotherapeutic 
targets. Targeting tsetse-trypanosome 
interactions with the goal of identifying 
genes that modulate trypanosome trans- 
mission has entered the realm of feasi- 
bility with the completion of the tsetse 
genome sequencing project. 
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